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Corn leaf disease possesses a huge impact on the food industry and corn 
crop yield as corn is one of the essential and basic nutrition of human life 
especially to vegetarians and vegans. Hence it is obvious that the quality of 
corn has to be ideal, however, to achieve that it has to be protected from the 
several diseases. Thus, there is a high demand for an automated method, 
which can detect the disease in early-stage and take necessary steps. 
However, early disease detection possesses a huge challenge, and it is highly 
critical. Thus, in this research work, we focus on designing and developing 
enhanced k-nearest neighbor (EKNN) model by adopting the basic 
k-nearest neighbour (KNN) model. EKNN helps in distinguishing the 
different class disease. Further fine and coarse features with high quality are 
generated to obtain the discriminative, boundary, pattern and structural 
related information and this information are used for classification 
procedure. Classification process provides the gradient-based features of 


high quality. Moreover, the proposed model is evaluated considering the 
Plant-Village dataset; also, a comparative analysis is carried out with 
different traditional classification model with different metrics. 


This is an open access article under the CC BY-SA license. 


Corresponding Author: 


Daneshwari Ashok Noola 

Department of Information Science and Engineering, Bijapur Liberal District Education Association 
Vachana Pitamaha Dr. P. G. Halakatti College of Engineering and Technology 

Vijayapura, India 

Email: ise.daneshwari@bldeacet.ac.in 


1. INTRODUCTION 

Several smart technological developments and utilization of machine learning techniques in recent 
years have revolutionized various fields like electronic media, medical, engineering, defense, and agriculture 
[1]. Among these fields, agriculture is drastically developed in recent years and one of the most benefitted 
fields from these smart technological developments [2]. As agriculture is the major economic source of the 
country, the utilization of smart agriculture techniques can provide a major boost to the economy of a 
country. Besides, there are several crops which can heavily impact the economy of the country. These crops 
are majorly produced in the country and even exported outside the country as well. One of those crops is corn 
which plays a significant role in agriculture field [3]. Corn is an essential traditional crop [4] which is utilized 
as human food, animal feed and also used as a raw material in several industries. The quality of corn kernels 
majorly depends on the crop yield and corn production [5]. A fast and effective technique is required to 
assess the quality of corn kernels [6]. However, several leaf diseases occur in the corn crop, can heavily 
impact the quality of crop [7] as well as yield. Moreover, the production rate of the corn crop is also affected 
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heavily due to plant leaf diseases [8]. Therefore, the precise identification of leaf diseases is very essential to 
maintain the production rateand crop yield. 

In [7], a corn disease identification method is adopted using the classification of leaf diseases based 
on support vector machine (SVM) classifier. A detailed study on segmentation of leaf diseases is also 
presented. In [8], a CNN architecture is adopted for corn leaf disease identification which enhances automation 
and digitalization in agriculture. Here, convolutional neural network (CNN) architecture enhances the 
classification accuracy of leaf diseases. In [9], an improvised machine learning technique is presented for the 
detection of crop leaf diseases. Here, handcrafted features are adopted which gives information about 
histogram-oriented gradient (HOG), segmented fractal texture analysis (SFTA) and local ternary patterns 
(LTP). In [10], a feature enhancement and a robust AlexNet technique arepresented for maize leaf disease 
identification. Modified neural network architecture is designed based on robust AlexNet technique. 
However, potential plant leaf disease detection techniques are not mature enough to use in practical 
applications. Several challenges need to be addressed like precise identification of plant leaf diseases, 
identification of various factors which can affect crop production and quality of corn and efficient feature 
extraction for the identification of disease type [11]. 

In this study, an enhanced k-nearest neighbour (EKNN) classifier is utilized for the precise detection 
ofcrop leaf diseases. Here, the detection of the leaf disease process is segregated into four phases using the 
proposed EKNN model. Moreover, the first phase describes the pre-processing process which filters the 
availability of noise from leaf images. Further, phase 2 discusses the segmentation process of leaf lesions to 
identify lesion boundaries and pattern related information. Then, phase 3 discusses the proposed feature 
extraction process to extract structure-related information corn leaf images. Finally, leaf classification is 
conducted on obtained features for the detection of leaf diseases using proposed EKNN model. This article is 
presented in the following manner. Section 2 describes the mathematical modelling of proposed corn leaf 
classification modified KNN technique. Section 3 describes the experimental results and their comparison 
with traditional leaf classification techniques and section 4 concludes the article. 


2. MODELLING OF PROPOSED ENHANCED KNN CLASSIFIER 

The health of the maize plant itself can be represented by the leaves [12]. Plants suffering from 
disease usually have yellowish or brownish leaves, as well as patches or decaying areas. This data is 
something we'd like to extract from image data [13]. During feature extraction process, features are 
considered to be discriminative that have low dimensions [14], and robust against change in data. Red, green, 
and blue (RGB) characteristics are extensively employed in image processing and pattern recognition to 
extract color information. RGB is highly recommended for object detection in images with substantial color 
variations [15]. Think of the RGB color as all the different colors that can be created using three different 
colored lights for red, green, and blue. The RGB values ranged from 1 to 255. We are going to normalize 
them from 0 to 1 in this task. The Figure 1 indicates the different diseases on corn leaf. 


Figure 1. Corn leaf disease types 


Digital image processing mechanism has been adopted recently for leaf disease identification; also, 
the structure is an eminent feature for pattern recognition [16]. Nevertheless, structure related feature provides 
precise information regarding crucial feature extraction such as shape, object pattern and boundaries. 
Moreover, structure-related information is classified into two distinctive categories for absolute leaf disease 
detection; the first category gives information regarding fine feature and second category provides an 
information about coarse features [17]. The architecture of the proposed model is given in Figure 2. 
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Figure 2. Proposed model architecture 


The feature extraction process divides the image into a huge number of feature vectors, each of 
which is unaffected by image translation, scale, or rotation, but is affected by lighting changes and local 
geometric distortions [18]. These characteristics are similar to neurons in the primary visual cortex [19], 
which are responsible for primate vision's basic forms, colors, and motions for object detection. Based on the 
Euclidian distance of the feature vector, this technique looks for candidates from related features. The fine 
features are considered using (1). 

Let us consider an area of pixels (central) in one image M, hence Fine-the feature can be given (1): 


fine_featr,u = Luz 2" j (Stu — So) (1) 


In the (1) So and S¢,y indicates the greyscale value of area of pixel (AoP) of central and u‘ adjacent 


pixels on a circle that has radius as t. Moreover, the whole adjacent pixels are given as U that is performed 
using (2). 


: 1, >0 
9) ={5 $0 Q) 


Further rotational constant fine feature is given in (3). 


, {Ltj (Su — So), E(fine_featruyu) < 2 
ine feat, = Su =o KIRN 3 
fose feats y U+41, otherwise (a) 


In the (3), V represents a rotational constant form of fine_feat and obtained fine features are uniform 
with E < 2. Here, D is uniformity evaluator, which is defined by (4), 


E(fine_featry) = Zizo lj (Su — So) —i(Su-1 — So)+1f(Su-1 — So) -CS0 — So) (4) 


Furthermore, sign and magnitude component of given filter response are designed through a given equation; 
also multi-scale histogram of same is represented through (5). 


Cu = |Seu — Sol 
Jtu = Stu = So), (5) 


Nevertheless, the components in (5) are encoded to achieve a histogram representation of multi-scale 
histogram representation and can be represented using (6). 


coarse_featr y = PUZ 2". j(Cu — 0) (6) 


In the (6), in case of an input image bt the average value is expressed through n; further central AoP is 
achieved are encoded with the (7), where nj indicates central pixels value. 


coarse_featry = j(So — Ox) (7) 


Further, confined intensity directional order relation (DOR) is adopted for defining the intensity relationship 
among nearest pixels through gradient features; further confined intensity-DOR uses the confined ordinal 
data for determining the intensity relation among neighbouring pixel for each central AoP. Moreover, 
direction set (DS) based encoding strategy is used for segregating the neighbouring pixel to multisets pixel 
through pointing towards one direction and maintaining the rotational invariance. 

Later, encoding is carried out through set wise intensity; furthermore, it is very important to identify 
the dominant direction through changing the greyscale value [20] for each pixel to a given average greyscale 
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of provided shaped regions of the given image. Forgiven image N and central AoP y, average greyscale is 
given through (8). 


So = P(Sow) (8) 
Stu = O(Stuw) (9) 


In the (9), arbitrarily shaped of the size wxw around the AoP with notation zcan be presented as So,w where 
arbitrarily shaped regions with neighbouring pixel sut”. Further, p(-) indicates the average greyscale of the 
arbitrary. Hence, this model improvises robustness in terms of noise and furtherit gives the extended structure 
for the leaf disease. Later neighbouring pixels are flipped through pointing to the specific dominant direction 
for developing the rotational constant. Moreover, the dominant direction is considered as neighbouring pixel 
index where the difference from AoP is on a higher value, and it is shown in the (10). 


C=arg max — to (10) 


t 
vefo, 1. V-11} 


In the (10) indicates class label of histogram and proposed model identifies the class among all the class that 
have the biggest summation of adjacent pixels. Further, above equation employs the evaluation coefficient 
for identifying the neighbour pixels and further feature set through class labels. Moreover, high classification 
is obtained through the various class label for detecting the leaf disease in corn leaves. 


3. RESULTS AND DISCUSSION 

In this section of the research, we evaluate proposed methodology considering the various parameter; 
further proposed model is compared with various existing methodology. Moreover, different parameter like 
classification accuracy, precision and area under curve (AUC) are considered for the comparative 
analysis [21]. In general corn leaf identification follows the four distinctive steps, the first method is 
pre-processing which includes the noise reduction and faster processing [22]. Second step includes the 
segmentation process where disease area is identified through the prominent leaf lesion and accurate 
boundaries; this process has been carried out in the previous research work. In third step high quality, fine 
and coarse features are extracted toachieve the high quality. 

Moreover, an adequate number of image sets are used for the training the proposed model for leaf 
disease evaluation; plant village dataset is used, and leaf disease are taken from the dataset [23]. Plant village 
dataset is parted into four distinctive diseases i.e., healthy classes, cercospora-leafspot gray, northern leaf 
blight and common rust [24]. Each class comprises several numbers of leaf disease that are depicted in 
Table 1, also each images have the pixel resolutions of 256x256, and entire project is simulated through 
MATLAB. Moreover, high-quality features are achieved through proposed model and classification process 
is carried out on the extracted features. Proposed models identify he disease presence in the leaves and 
determines the particular class of disease where it belongs. 


Table 1. Dataset details 


Class Total images 
HL 1162 
LSG 513 
NLB 985 
RS 1160 
Total images 3820 
Class Total images 


3.1. Perfomance metrics 

In this section, we evaluate the model considering the different deep learning metrics like accuracy, 
sensitivity, specificity, and AUC. Table 2 shows the different metrics; in here EKNN observes massive value 
of accuracy, sensitivity, specificity, and AUC of 99.86, 99.60, 99.88, and 99.75 respectively. The process of 
calculating the performance metrics is calculated as: 
— The number of cases correctly diagnosed as disease is called a true positive (TP), 
— The number of false positives (FP) is the number of cases that are wrongly diagnosed as disease, 
— The number of instances correctly classified as healthy is called a true negative (TN), 
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— The number of instances wrongly diagnosed as healthy is called a false negative (FN). 


Table 2. Observed performance metric of EKNN 


Performance metrics Value 


Accuracy 99.86 
Sensitivity 99.6 
Specificity 99.88 

AUC 99.75 


3.1.1. Accuracy 

A test's accuracy is determined by its capacity to appropriately distinguish between disease and 
healthy cases. We should determine the fraction of true positive and true negative in all analyzed cases to 
estimate a test's accuracy [25]. This can be expressed mathematically as: 


Accuracy = TP+TNTP+TN+FP+FN 


3.1.2. Sensitivity 
A test's sensitivity refers to its capacity to appropriately identify disease cases. Calculate the fraction 
of true positives in leaf disease instances to estimate it. This can be expressed mathematically as: 


Sensitivity = TPTP + FN 


3.1.3. Specificity 
A test's specificity refers to its capacity to appropriately identify healthy instances. Calculate the 
fraction of true negatives in healthy instances to estimate it. This can be expressed mathematically as: 


Specificity = TNTN + FP 


To compare two state of arttechnique is considered i.e., AlexNet and DMS-Robust AlexNet. In Table 3, first 
column presents the model’s name, second column present the disease category, third, fourth and fifth 
column presents precision, recall, and F1. To extract the pixels and features from the image, the corn leaf 
image undergo segmentation for accurate pixel extraction. The process of image segmentation levels is 
indicated in Figure 3. 

In Table 3, HL represents the healthy leaf, LSG represents the gray leaf spot, NLB represents North 
leaf blight and RS indicates common rust observed in the leaf. Moreover, for healthy leaf disease proposed 
model observes precision, recall and F1-score of 99.7, 99.94 and 99.82 respectively. In case of gray leaf spot in 
corn leaf, EKNN observes 99.87, 99.1 and 99.48 percentage of precision, recall and Fl-score respectively. 
Further, for north leaf blight disease EKNN achieves 99.48 percentage of precision, recall and Fl-core; 
similarly forcommon rust disease, EKNN model achieves precision of 99.77, recall of 99.88 and Fl-score of 
99.83. 

Figure 4 shows ROC receiver operating characteristics of EKNN model for the four distinctive types 
ofleaf i.e., healthy leaf, gray leaf spot, North leaf blight and common rust. Further, ROC is plotted between 
true positive rate and true negative rate; also, different color is used to distinguish the other leaf disease. The 
disease detection accuracy of the proposed and existing models is represented in Figure 5. The results 
indicate that the proposed model accuracy is more than the traditional methods. 


Table 3. Comparison of EKNN with various existing methodology 
Model Category Precision Recall F1 
AlexNet HL 95.31 95.31 
LSG 94.39 94.28 
NLB 93.51 94.46 
RS 92.72 94.87 
DMS-Robust AlexNet HL 99.2 98.44 
LSG 99.16 98.21 
NLB 98.03 98 
RS 98.95 98.47 
EKNN HL 99.7 99.94 
LSG 99.87 99.1 
NLB 99.48 99.48 
RS 99.77 99.88 
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Figure 3. Image segmentation levels 


0.8 


boag 
a 


bag 
eS 


True Positive Rate 


0.2 


0.0 


T T 
0.0 0.2 0.4 0.6 0.8 1.0 
False Positive Rate 


Figure 4. ROC curve using proposed EKNN model for multi-class classification 
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Figure 5. Disease detection accuracy levels 


4. CONCLUSION 

Early detection of corn leaf is considered eminent due to the major utilization of corn crop. Hence, 
enhanced-KNN is designed for absolute identification of corn disease and further distinguish the classes of 
disease. Moreover, EKNN follows the enhanced mathematical modelling for achieving the high-quality 
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features. In here, fine, and coarse feature are achieved through the proposed model to improvise the 
classification accuracy. Further, confined intensity-DOR is adopted to achieve the low—dimensional through 
intensity relationship optimization among the neighbouring pixels. Furthermore, an optimized mechanism 
strategy named Directional set is developed for segregating the neighbouring pixels into the various sets 
through pointing into the specific direction. Proposed model EKNN is evaluated considering the various 
traditional mechanism and existing mechanism. Moreover, proposed model achieves observes massive value 
of accuracy, sensitivity, specificity, and AUC of 99.86, 99.60, 99.88, and 99.75 respectively. Further 
comparative analysis is carried out in terms of precision, recall and F1 score. 
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